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May 17, 2002 (20020517) 

IWATSUKI HIDEKI . . . 
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ABSTRACT 

PROBLEM TO BE SOLVED: To find a desired stock name relatively easily. 

SOLUTION: This system is equipped with a storage means 13 which stores 
factor values by stock names and a processing means 12 which sends a 
rearranging weight input field where rearranging weights are inputted by 
factors to a user terminal 1, computes the evaluation of a stock name 
according to the rearranging weights inputted to the input field and the 
factor values of the stock names, and sends a name list display field 
having been sorted in evaluation order to the user terminal 1. The 
processing means 12 sends an object range setting field wherein minimum 
or/and maximum values are set by the factors to the user terminal 1 
and only stock names within a range limited in the object range setting 
field is regarded as objects of operation for evaluation. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To provide an electronic map retrieval system which 
has high retrieval precision and can correct a display position. 

SOLUTION: This device is provided with a retrieval means 32 which computes 
property data retrieved by decomposing a retrieval key word into elements 
and weighting them, and rates of matching, a candidate plate list display 
means 33 which lists and displays the retrieved property data as candidate 
places in the decreasing order of the matching rates, a map display means 
34 which displays a map according to the candidate place list, a position 
correcting means 35 which corrects the position of a target item 

position mark displayed on the retrieved map, and a correction position 
storage means 23 which stores the corrected position on the map. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To provide an information retrieving device which 
determines the weight of each word so as to retrieve a document matching an 
inputted key word and can retrieve the matching document with stable 
retrieval precision. 

SOLUTION: A document ranking part 22 selects a set of documents matching a 
key word 10 from a document database 25 having retrieval object documents 
and word statistic information 25' included in them. At this time, a 
calculation expression with which weight never becomes minus is used for 
weighting . A word ranking part 23 selects words in a matching 

document D according to relativity, generates a new key word F obtained by 
adding them as key word relative words to the original key word , and 
selects a matching document 30 with the new key word F. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To make it possible to automatically estimate an 
error part by utilizing a text whose error generation inclination has been 
determined . 

SOLUTION: An error candidate detection part 1 includes error candidate word 
group knowledge la and detects error candidate words from an inputted text, 
A co-occurrence information extraction part 2 extracts various co- 
occurrence information corresponding to the error candidate words from the 
inputted text. A control part 3 controls the detection part 1 and the 
extraction part 2 so as to extract co-occurrence information over the whole 
text area. The extracted co-occurrence data are integrated by a statistic 
information totalization part 4 and the appearance patterns of a target 

word in the whole document are totalized. An error judgment part 5 judges 
whether error candidates extracted on the basis of the co-occurrence 



information are correct or not. It is also available to correct the 
knowledge of an error candidate word group by preparing an error statistic 
integration part 6 for integrating statistic information concerned with 
errors or a similarity evaluation part 8 for evaluating the similarity 
of two words in the error word group. 
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or between 



ABSTRACT 

calculate the degree of similarity between polysemy words 
words in different languages by using a word dictionary 
describing corresponding relation between each word and its concept and a 
concept system describing the master-slave relation of concepts and 
independent of languages in respect to a similarity degree calculating 
method for calculating the degree of similarity between words . 

CONSTITUTION: This similarity degree calculating method is provided with 
the word dictionary 1 and the concept system 2, concept groups 
corresponding to plural inputted words are respectively extracted by 
retrieving the dictionary 1, the degree .alpha, of similarity between the 
extracted concept groups is calculated, corresponding master concepts 
(and/or slave concepts) are extracted from these extracted concept groups 
by retrieving the system 2, and the degree .beta, of similarity between the 
extracted master concepts (and/or the slave concepts) is calculated. These 
degrees .alpha., .beta. of similarity are weighted to calculate the 
degree .delta. of similarity as the degree of similarity between words . 
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ABSTRACT 

PURPOSE: To reduce the time and labor for preparation, by changing a frame 
by a ruled mark and the positions of each header and each another 

item for new digit length by changing the digit length of a part of items 
for a picture for inquiry processing. 

CONSTITUTION: In an item digit length change procedure 14, the change of 
the number of rows checked by a change item checking procedure 13 is 
performed by inputting the new digit length from a keyboard input device 4 
by an operator, and the increased/ decreased value of the new digit length 
from present digit length is calculated. Next, in a header position moving 
procedure 15, the header position being displayed in a right direction is 
moved rightward or leftward setting the left end position of a change item 
as a start point by increasing/decreasing by the above increased/decreased 
value. In the next item position change procedure 16, the item 

position being displayed in the right direction is moved rightward or 
leftward setting the left end position of the change item by 
increasing/decreasing by the increased/decreased value. Also, in a ruled 
mark frame change procedure 17, the position of the frame by the ruled mark 
being displayed in the right direction is changed in a form where it is 
moved to the right or left by increasing/decreasing by the 
increased/ decreased value setting the left end position of the change item 
as the start point. 
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Abstract (Basic) : KR 2002009779 A 

NOVELTY - A system and method for providing information on real 
estate is provided to achieve the standardization of valuation for real 
estate by setting up common valuation items, by receiving valuation 
marks from a group assigned to evaluate the items, by processing data 
on the valuation marks in a determined form, and by providing the data 
processed to clients. 

DETAILED DESCRIPTION - An evaluation server (10) includes an input 
unit (11), the first database ( 13 ) , a valuation processing unit (14), the 
second database unit(15) and a transmission unit(12). The input 
unit (11) receives valuation marks on all sorts of evaluation items 

The first database (13) respectively stores data on real estate 
valuation objects. In addition, the first database(13) stores data on 
the valuation marks inputted from the input unit(ll). The valuation 
processing unit (14) estimates the final valuation mark. The second 
database unit (15) stores evaluation -mark data on' each real estate 
valuation object estimated from the valuation processing unit (14). The 
transmission unit (12) processes the valuation mark data of the second 
database unit (15) in a determined form and transmits the data to a 
client computer. 
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Abstract (Basic) : KR 2001092922 A 

NOVELTY - A method for searching a similar web site is provided to 
enable a user to search a web site similar to the web site with the 
user was familiar by enabling a web server to register URLs of web 
sites and search words and compare a URL the user inputs with an 
arbitrary search word, thereby enabling the web server to estimate the 
degree of similarity. 

DETAILED DESCRIPTION - A user wants to search a web site similar to 
a specific web site. The user inputs a URL of the site(10). A web 
server stores search words including rankings and weight values 
and makes the first search word group of the search words (12). The 
user inputs an arbitrary search word to an input window (14). The search 
word the user inputs is compared with the first search word group. 
It is judged whether the search term is repeated ( 16) . In case that 
there's the repeated search word, the search word in the first 
search word group is removed ( 18.) • The search word inputted from, the., 
user is inserted into a relevant ranking(20). The second search word 

group is formed(22). A candidate web site is selected (24 ) . Weight 
values are given to search words in the second search word 
group ( 26) . 
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Target evaluation item position display system for strategic 
planning, displays data in portfolio map, corresponding to. coordinate 
value computed for every extracted data set from management analysis 
database 
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Abstract (Basic) : JP 2001357197 A 

NOVELTY - A management analysis server (14) extracts a specific 
data set from a management analysis database (17) storing data set 
comprising combination of several evaluation value according to various 
references, based on extraction conditions designated by the client 
(2) . The coordinate value is computed for every extracted data set, and 
the data is displayed in a portfolio map, corresponding to the 
coordinate values. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 

computer readable medium storing target for evaluation item 
position displaying program. 

USE - For displaying position of target evaluation item in 

multidimensional space for strategic planning in enterprise management. 

ADVANTAGE - Performs effective display of position of target 
evaluation item effectively. 

DESCRIPTION OF DRAWING (S) - The figure shows a block diagram of the 
target evaluation item position display system- (Drawing 
includes non-English language text) . 

Client (2) 

Management analysis server (14) 
Management analysis database (17) 
pp; 24 DwgNo 1/27 
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Composite word similarity judging system judges whether each single 
word of divided composite word is included in other composite word, based 
on which similarity between each single word in composite words is 
computed 
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Abstract (Basic) : JP 2001325292 A 

NOVELTY - An acquisition unit (22) produces word information list 
from a storage unit (31) for every single word of composite word which 
is divided into word level stored in a dictionary. A comparison unit 
(23) judges whether each single word of another composite word is 



included in the word list. A decision unit (24) computes the 
similarity of composite words , based on evaluation value calculated 

by comparison result. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 

following : 

(a) Word similarity judging method; 

(b) Recorded medium storing composite word similarity judging 

program 

USE - For judging similarity of composite word . 

ADVANTAGE - Similarity evaluation between composite words is 
performed easily, by computing similarity for every word based on 
predetermined synonym information stored beforehand in word 
relationship dictionary storage unit. 

DESCRIPTION OF DRAWING (S ) - The figure shows the block diagram of 
composite word similarity judging system. (Drawing includes 
non-English language text) . 

Acquisition unit (22) 

Comparison unit (23) 

Decision unit (24) 

Storage unit (31) - • - 
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Abstract (Basic) : WO 200161571 A2 

NOVELTY - Data items stored in a database (15) has several 



associated attributes, which are logically linked to stored values and 
to weight for associated attributes. A search system (20) when input 
with a data item , identifies another data item which has 
attribute, values and associated weights similar with the input 
data items. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
database management method. 

USE - For searching and retrieving data from computer database e.g. 
in Internet. 

ADVANTAGE - Linking data attributes with relevant weightings 
increase the precision of search results and the usefulness of orders 
in which search results are presented. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
attribute tagging and matching system of database management system. 

Database (15) 

Search system (20) 
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Abstract (Basic) : WO 200123989 A2 

NOVELTY - Product promotional information is displayed as a set of 



icons to the user and a signal initiated by the user by activating a 
specific icon. Another short-listed information JLs {displayed as another 
set of icons to the user based, on the received/signal. Five words or 
less of text are displayed witrK each icon witj/in both sets of icons as 
a description for promoting products. . / 

DETAILED DESCRIPTION - INDEPENDENT CLALMS are also included for the 

following: \ / 

(a) Price information providing method; 

(b) Digital representation of promotional item providing method; 

(c) Method of using electronic oVdering system; 

(d) Promotional items ordering^ method; 

(e) Price information obtaining/method; 

(f) Item providing method; / \ 

(g) Database updating method / \ 

USE - For providing purchase/ information, items , orders and 

digital representations using electronic ^communication system e.g. 

Internet. / \ 

ADVANTAGE - Orders can be? split between\ suppliers , supplier 
locations, or shipping local/ions. Financial Arrangement for paying for 
orders are highly flexible/and include mixed ^payments > customer access- 
or automatic authorization! notification. Various process segments are 
automated to reduce huma/ interfacing in the ordering process. 

DESCRIPTION OF DRAWING ( S ) - The figure showsNthe illustrations of 
views seen by a user wttL-le accessing information ror promotional items. 
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Density function for query terms for ranking the most relevant 
documents or fragments of documents and for breaking a tie in the 

scoring of order elements of a list 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

RD 431186 A 20000310 RD 2000431186 A 20000220 200030 B 

Priority Applications (No Type Date) : RD 2000431186 A 20000220 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
RD 431186 A 1 G06F-000/00 

Abstract (Basic) : RD 431186 A 

NOVELTY - For each element in the hit list, its density function is 
computed, which in its simplest form is the reciprocal of the number of 

terms between the first and last terms which is part of the 'bag 
of words' submitted to a query. The larger the density, the higher a 
document should be ranked. The density function can be used to rank 
order elements of a list which have been scored equally by a different 
ranking algorithm and can also be used with other algorithms. 

USE - Determining a set of documents satisfying a specific query. 

ADVANTAGE - Percolating most relevant documents and passages to top 
of the hit list. 
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Food order fulfillment checking apparatus in quick service restaurant 
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Patent Family: \ / 
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US 5937386 A 19990810 US 96762038 A / 19961209 199943 B 
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US 5937386 A 6 G06F-153/oV) / 

Abstract (Basic) : US 5937386 A \ / 

NOVELTY - A food order comprising different customer selected menu 
items is input into a computer through a point of sale register and to 
a food preparation area. The computer calculates and compares a total 
target weight specification forv ordered items to total weight of 
bagged items by executing different programs, to verify proper 
fulfillment of the order. / \ 

DETAILED DESCRIPTION - The /f ood\preparation area includes two 
substations for placing bagged orders, in waiting' queue and weighing 
bagged orders, respectively. /The tota\ target weight specification for 
the ordered food items ii calculated based on predetermined weight 
specifications of each ordered food Ytem - Separate program 
functions are designed for /execution by Vhe processor of the computer 
for target weight specification calculation, storage of food order dat 
and comparison between target and actual weights of bagged food items. 
An INDEPENDENT CLAIM is aAso included for oVder fulfillment checking 
method. / \ 

USE - For checking order fulfillment of bagged food items in quick 
service restaurant. / \ 

ADVANTAGE - As the computer executed specially designed program 
function for checking contents of bagged food iVems, the restaurant 
personal are alerted by the computer system on detecting overweight or 
underweight condition of the packaged order, therfeby preventing 
delivery of incomplete order to a customer and placing a wrong item in 
the bag of outgoing order. As the apparatus has a sVmple structure to 
check fulfillment of an order, the apparatus is easy to install, 
operate, service and maintain and detects order inaccuracy to overcome 
inaccurate filling of order without extrahandling and heavy investment 

DESCRIPTION OF DRAWING (S) - The • figure shows ' the " diagrammatic ' 
layout of food order fulfillment checking method. 

pp; 6 DwgNo 1/1 

Title Terms: FOOD; ORDER; CHECK; APPARATUS; QUICK; SERVICE; RESTAURANT 
Derwent Class: T01; T05 

International Patent Class (Main) : G06F-153/00 
File Segment: EPI 



22/5/16 (Item 10 from fileWSSO) 

DIALOG (R) File 350: Derwent WPIX /\ 

(c) 2004 Thomson Derwent. Alj/rtsV reserv. 



011047902 **Image available** 

WPI Acc No: 1997-025826/199703 

XRPX Acc No: N97-021641 

Shared screen cursor shape switching method using information processing 
appts - includes step to erase directions mark shape/of sub interactive 
person currently displayed when similar mark of other sub interactive 
person is operated 

Patent Assignee: HITACHI LTD (HITA ) A HITACHI SO^fWARE ENG CO LTD (HISF ) 
Number of Countries: 001 Number of Parents: 001 
Patent Family: 

Patent No Kind Date Applicat n)d Kirfd Date Week 
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Priority Applications (Na Type. Date) JP ^5^2205 -A 19950418 
Patent Details: 
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JP 8286802 A 6 G06F-003/02 



Abstract (Basic) : JP 8286802 A 

The shared screen cursor shape/ switching method uses a direction 
mark of suitable shape for each interactive person. If the mark points 
to a specific point, the direction mark of \ the second speaker who 
specified the same point on eachf shared screen is displayed. The main 
interactive person always operates a direction mark. 

Regarding other sub interactive person, k specified sub interactive 
person is not allowed to operatte a directions\ mark while others operate 
the same. When the first interactive position terminates the 
operation of a direction marl! it will be erased when other sub 
interactive person attempts to operate the directions mark. 

ADVANTAGE - Shares direction mark by common Information processing 
system. Simplifies communi caption among different person. 
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Computer based text classification method for large organisations - by 
parsing input text and deriving similarity scores for categories in 
knowledge base, thresholds then applied discarding least similar to 
keyword set 
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Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 5371807 A 20 G06K-009/72 

Abstract (Basic) : US 5371807 A 

The method involves accepting as input natural language text, which 
is parsed into a first list of recognized keywords. 



The first list is used to deduce further facts from the input text. 
The deduced facts are compiled into a second list. A numeric 
similarity score for categories in the knowledge base, with the 
natural language text is calculated. 

A dynamic threshold is applied to determine which of the categories 
are most similar to the recognized keywords of the first list. This 
is based upon a similarity score of a most similar category, and a 
predefined threshold offset. The categories are based upon their 
similarity scores by discarding categories whose similarity 
scores are below the threshold value. The categories determined to be 
most similar in the previous step are compiled into a third list. The 
first list, the second list and the third list are passed to an 
external application. 

USE /ADVANTAGE --.For e.g. database searches. High -accuracy- of - 
method. High performance especially over time. 
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Method of using electronic dictionary - involves preliminarily defining 
letter weight numbers - individually corresp. to - alphanumeric • symbols and 
position weight numbers individually corresp. to letter positions in 
any word 
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Priority Applications (No Type Date): JP 8615342 A 19860127; JP 85215691 A 
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US 5297038 A 9 G06F-015/40 Cont of application US 86912530 

Cont of application US 88186721 
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Abstract (Basic) : US 5297038 A 

The method involves defining the letter weight and position 
weight numbers and obtaining a work-representing hash value for each 
of given words to be stored by sequentially calculating 
letter-representing hash values of individual letters of each given 
word from both the letter and position weight numbers respectively 
corresponding to the individual letters and a predefined divisor and by 
adding the sequentially calculated letter-representing hash values. The 
obtained work-representing hash values are then sequentially arranged 
in numerical order separately for even-letter words and odd-letter 
words . 



The method further involves dividing each of the word-representing 
hash values into an index section represented by high digits of it and 
a data section represented by lower digits of it. Groups of the 
word-representing hash values are formed according to the index 
sections separately for even-letter words and odd-letter words. Numbers 
of the work-representing has values are then stored in groups. 
Addresses of the first word -representing hash values are also 
calculated and stored in the groups. 

ADVANTAGE - Electronic dictionary which can be searched efficiently 
for e.g. spell-check purpose. 
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Microcomputer system data transfer controller - uses comparison of 
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Abstract (Basic) : EP 110199 A 

Method for controlling the data transfer between a data transmitter 
and a data receiver on a bus, using a control unit which is connected 
to the bus and which controls the data transfer in dependence on a 
channel program stored in a memory and' 

containing transfer commands and control commands, characterised in 
that, after termination or interruption of the processing of a transfer 
command, the cause identifying the termination or the interruption of 
the data transfer is set in corresponding register positions (BC, ET, 
MA, VE) of a first register ( RG1 ) , that, in order to determine whether 
a conditional control command (ST) following the transfer command is to 
be executed, the register positions allocated to the first register 
(RG1) are written in from the conditional control command (ST) in a 
second register 

(RG2 ) , and that the mutually associated register positions of the 
first and second register are compared with one another and the 
conditional control command is executed with a positive result of the 
comparison and the command of the channel program following the 



* conditional control command is executed with a negative result of the 
comparison . 
(13pp) 

DE 3241402 A 

The controller is coupled to the data bus to control data transfer 
according to a stored channel programme containing a transfer command 
and a control command. The transfer command is processed and the end 
result is used for selecting a register position in a first register. 
The control command is then entered in a given position in a second 
register corresponding to the position of the first register. 

The two register positions are compared via an operation 
multiplexer of a sequencer control which provides the control command 

or the following command after this in the channel programme. 

0/5 ' ..... 
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Transmitter for data between memory stores - has variable word data 
recorded in state marked off by positioning codes 
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Priority Applications (No Type Date) : JP 75125870 A 19751021 

Abstract (Basic) : US 4103334 A 

A device for controlling the handling of data has a main memory for 
storing a large number of record data items in a state marked off by 
record-positioning codes. The data items are formed of series- arranged 

words of variable length, each containing characters marked off by 
word - positioning codes. A processing memory of matrix type is formed 
of columns containing words each having characters falling within a 
specified range, and rows, and stores at least one record data item 
read-out of the main memory. 

Characters constituting the foremost word data item are written in 
a row. Upon detection of a word positioning code immediately 
following the foremost word data item, the succeeding word data item is 
written in an adjacent row. Read-out of data items from the main 
1 memory is ended when a record-positioning code following the last word 
data item of each row' is detected. 
Title Terms: TRANSMIT; DATA; MEMORY; STORAGE; VARIABLE; WORD; DATA; RECORD; 
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Abstract- Previous work addressing the issue of word distribution in 
documents (D. Beeferman et al . , 1997; S.M. Katz, 1996) has shown the 
importance of word repetitiveness as an indicator of the word 
content-bearing characteristics. The authors propose a simple method using 
a measure of the tendency of words to repeat within a document to separate 
the words with similar document frequencies, but different topic 
discriminating characteristics. We describe the application of the new 
measure in query-document relevance scoring - Experiments on the TREC Ad 
Hoc and Spoken Document Retrieval tasks (E.M. Voorhees and D . Harman, 1999) 
show useful performance improvements. (8 Refs) 
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Abstract: Name extraction is indispensable for both natural language 
understanding and information retrieval. However, proper names are major 



unknown words in natural language texts, and unknown word identification is 
still a challenging problem in natural language processing. We deal with 
identification of person names, organization names and location names from 
Chinese texts. Different types of information from different levels 
of text are employed, including character conditions, statistical 
information, titles, punctuation marks , organization and location 
keywords , speech-act and locative verbs,, cache and n-gram model. We also 
clarify which strategies can be used in which cases, i.e., queries and/or 
documents. In our experiments, the recall rates and the precision rates for 
the extraction of person names, organization names, and location names 
under MET data are (87.33%, 82.33%), (76.67%, 79.33%) and (77.00%, 82.00%), 
respectively. (23 Refs) 
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Abstract: Significant changes in exis 
the information superhighway. Change 
looking at the Internet (a convem 
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titutions are resulting from 
vitable, and I offer a way of 
proxy for whatever form the 
some of the changes that it is 
nternet is a mechanism for 
an intermediary between the 
does not explain everything 
that is happening, but it will fielp to Identify why some existing 
institutions will fail as a res/Qlt, and Why some individuals are so 
uncomfortable with the new technology. The ^ ew examples I give do not 
exhaust the disintermediative e/fects of the Internet. Banks and banking 
services are undergoing more chan/es because of interactive communications. 
Some jobs have already been changed by telecommuting. It seems likely that 
there will be significant restructuring of education as well. There may be 
similar repercussions for /marketers , retailers and many other 
institutions. Students and yUsers of modern computer networks would do 
well to consider the effects ft disintermediation wh&n assessing the future 
consequences of the Internet. In contemplating theV social, political and 
economic effects of networks, it is important to understand what is being 
changed and how existing institutions are likely to be affected. Whoever 
you are and whatever you do, there is a good chance that there will be some 
disintermediation in your future. How you adjust to it and whether it is 
good or bad will be up to you. It is not too early to prepare. (0 Refs) 
Subfile: C 

Descriptors: Internet; politics; socio-economic effects 
Identifiers: disintermediation; Internet; information superhighway; 

change; intermediary removal; institutional failure; banking services; 

interactive communications; telecommuting; educational restructuring; 



marketers; retailers; computer ne tweaks/ social effects; political effects; 

economic effects; future 

Class Codes: C0230 (Economic, socLaXand political aspects of computing) 
; C5620W (Other computer networks ); ~e72 10\J Information services and 

centres ) 

Copyright 1996, IEE 



22/5/15 (Item 4 from file: 6) 

DIALOG (R) File 6: NTIS 

(c) 2004 NTTS, Intl Cpyrght All Rights Res. All rts . reserv. 

0800912 NTIS Accession Number: PB80-119506/XAB 

An Evaluation of Factors Affecting Document Ranking by Information 

Retrieval Systems 

(Final rept . 1 Sep 78-29 Feb 80) 

McGill, M. J. 

Syracuse Univ., NY. School of Information Studies. 
Corp. Source Codes: 013939059 

Sponsor: National Science Foundation, Washington, DC. Div. of Information 
Science and Technology. 
Oct 79 129p 
Languages: English 
Journal Announcement: GRAI8007 

Order this product from NTIS by: phone at 1-800-553-NTIS (U.S. 
customers); (7 03)605-6000 (other countries); fax at (703)321-8547; and 
email at orders@ntis.fedworld.gov. NTIS is located at 5285 Port Royal Road, 
Springfield, VA, 22161, USA. 

NTIS Prices: PC A07/MF A01 . - • 

Country of Publication: United States 

Contract No.: NSF-IST78-10454 

The study examines ranking algorithms used in a Boolean environment. The 
ranking algorithms are decomposed into term weighting schemes and 
similarity measures. Representative term weights and similarity 

measures are selected- from those known to exist in information retrieval 
environments. The, ranking algorithms are tested using documents submitted 
by specific clearinghouses to the Current Index to Journals in Education 
data bases. The study used information need statements from individuals 
with interests congruent with the data base. After searches were conducted 
by professional searchers, the retrieved documents were judged for 
relevance by the persons submitting the original information need 
statement. The algorithms were analyzed according to their ability to move 
relevant documents toward the beginning of the output list. The study found 
that when using a controlled vocabulary or the free text, it is possible to 
significantly improve the order of the output. The results also indicate 
that ranking is at best about 20 percent effective with the remaining 80 

percent not yet resolved. 
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Abstract: In this paper, we present a novel technique that can be used 
for fast similarity-based indexing and retrieval of both image and video 
databases in distributed environments. We assume that image or video 
databases are stored in the compressed form using standard techniques such 
as JPEG for images, and M-JPEG or MPEG for videos. The existing 
techniques, proposed in the literature, use computationally intensive 
features and cost functions for content-based image and video retrieval 
and indexing. The proposed algorithm uses an innovative approach based on 
histograms of DC coefficients only, and therefore is computationally 
less expensive than the other approaches. In the case of a JPEG-compressed 
image database, the query process is the following. The user submits a 
request for search -by-similarity by presenting the desired image. The 
algorithm calculates the DC coefficients of this image and creates the 
histogram of DC coefficients. Then, the algorithm compares the DC 
histogram of the submitted image with the DC histograms of the images 
stored in the database using a histogram similarity metric. The image 
database can be local or at a remote _ server . In . our experiments we 
compared several histogram similarity metrics : weighted Euclidean 
distance, square difference, and absolute difference. The algorithm then 
selects and presents to the user the images with the smallest values of 
the metric that best match the submitted image. In the case of a 
compressed video database, the similarity-based indexing and retrieval is 
more complex. The manipulation of a .video database consists of three main 
operations: (1) partitioning of the video into clips, (2) key frame 
extraction, and (3) indexing and retrieval of key frames. The proposed 
algorithm has been applied in all three steps. First, the DC histograms 
are implemented for partitioning each video into clips or camera shots. 
Then, in the next phase the same DC histograms are used to extract key 
frames and create a database of key frames only. Finally, in the last step, 
the user submits one or more video frames that he/she is searching for. 
We implemented the described algorithm for similarity-based retrieval to 
both image and video databases. The experimental results, presented in the 
paper, show that the proposed algorithm can be very efficient for 
similarity-based search of images and videos in distributed environments, 
such as Internet, Intranets, or local-area networks. 7 Refs. 
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The constraint database model has recently been proposed as a 
promising framework to model continuous variables and therefore images and 
spaces. Constraint databases are being increasingly used to solve spatial 
problems. This dissertation presents further advances on the use of the 
constraint data model in the context of similarity retrieval. 

Similarity-based retrieval is an important task in many spatial and 
image database applications. Given a representation scheme, it is necessary 
to employ some measure to determine the visual or spatial similarity of the 
objects. Spatial similarity is complex due to the numerous constraining 
properties of spatial objects and their embedding in space. We propose a 
new method to measure the similarities between two 2-dimensional spatial 
scenes that are composed of spatial objects, which are a set of points, 
line segments or polygons. 

We also investigate high-level database change operations that allow 
users to be concerned only with the new information they want to insert 
into the database and to leave the burden of resolving inconsistencies 
between the new and the old data to the database system. The principle of 
minimal change states that the result of adding the new information to a 
database should be the set of models of the new information that is closest 
to some possible models in the current database. Constructing similarity 
measure between spatial scenes allows us to study the change operations for 
constraint databases in terms of model-theoretic minimal change. We define 
several change operators based on the proposed similarity measure, and 
study the characteristics of these operators. 

Algorithms that implement the similarity queries and change 
operators are introduced. Analysis of the computational complexities of 
the algorithms shows that an efficient PTIME evaluation of similarity 
queries and several types of change operations in linear constraint 
databases is possible. We also describe and analyze experimental 
investigations on how well" the similarity measures agree with human 
intuition. Finally, we describe the implementation of similarity queries 
within the MLPQ/GIS system. 
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Previous work addressing the issue of word distribution in documents 
shows the importance of word repetitiveness as an indicator of the word 
content-bearing characteristics. Proposes a simple method using a measure 
of the tendency of words to repeat within a document to separate the words 

with similar document frequencies, but different topic 
discriminating characteristics. Describes the application of the new 
measure in query-document relevance scoring . Experiments on the TREC Ad 
Hoc and Spoken Document Retrieval tasks show useful performance 
improvements . 
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The study examines ranking algorithms used in a boolean environment. The 
ranking algorithms are decomposed into term weighting schemes and 
similarity measures. Representative term weights and similarity 
measures are selected from those known to exist in information retrieval 
environments. The ranking algorithms are tested using documents submitted 
by specific clearinghouses to the current index to journals in education 
data bases. The study used information need statements from individuals 
with interests congruent with the data base . After searches . were conducted, 
by professional searchers, the retrieved documents were judged for 
relevance by the persons submitting the original information need 
statement. The algorithms were analyzed according to their ability to move 
relevant documents toward the beginning of the output list. The study found 
that when using a controlled vocabulary or the free text, it is possible to 
significantly improve the order of the output. The results also indicate 
that ranking is at best about 20 percent effective with the remaining 80 
percent not yet resolved 
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